Implications of energy declination for speech synthesis
نویسندگان
چکیده
This paper examines whether observed phenomena in energy declination can be used to improve the naturalness of synthetic speech. In two production experiments different aspects of intensity fall-off within utterances are analysed including degree of stress, phrase length, phrase boundaries. Energy manipulation was carried out using diphone synthesis as a basis for generating stimuli for perception tests in English and Danish. The results of the listening experiments, in which different versions of a paragraph were ranked for naturalness indicate that amplitude differences can contribute to greater naturalness. However, it is apparent that fine-tuning of amplitude requires good quality synthesis at the more basic prosodic levels.
منابع مشابه
F0 declination in spontaneous Estonian: implications for pitch-related preplanning in speech production
This study contributes to the discussion on pitch-related preplanning in spontaneous speech production. It investigates the relationship of phrasal length with declination slope, and the initial and final F0 height in intonation phrases extracted from a corpus of Estonian dialogues. The analysis is based on data from 10 speakers. The results show that the declination in shorter phrases is steep...
متن کاملStudy and quantification of the declination for the Arabic speech synthesis system PARADIS
The modeling of the melody in a Text-To-Speech System is indispensable to have a good quality of synthesis and to approach the naturalness. The study of the melody generally includes the analysis of the local melody events relating to the accent and the declination of the global melody contour of an utterance. In this paper, we will present an experimental study of the declination phenomenon co...
متن کاملA study of F0 declination in Japanese: towards a discourse model of prosodic structure
This study investigates F0 declination as a global-level prosodic phenomenon, establishing a new discourse-based model of prosodic structure in Japanese. The model includes two levels of declination in a hierarchical order: utterance units and prosodic paragraphs, a higher level of declination consisting of embedded declinations. Comparing and contrasting three types of discourse -read speech, ...
متن کاملTone Distribution and Its Effect on Subglottal Pressure during Speech
The current work is part of a project to characterize the subglottal pressure (Ps) contour associated with a spoken utterance in terms of the distribution of pitch accents and of phrase and boundary tones. It is found that the nuclear pitch accent does not define the start of the termination phase; the utterance offset is a better marker. Declination rate of the working phase and its relation t...
متن کاملClassification of Thai Tone Sequences in Syllable-Segmented Speech Using the Analysis-by-Synthesis M - Speech and Audio Processing, IEEE Transactions on
Tone classification is important for Thai speech recognition because tone affects the lexical identification of words. An analysisby-synthesis algorithm for classifying Thai tones in syllable-segmented speech is presented that uses an extension to Fujisaki’s model for tone languages that incorporates tonal assimilation and declination. The classifier correctly identifies all of the tones in 89....
متن کامل